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AMENDMENT TO THE SPECIFICATION 



Please delete the all embedded hyperlink and/or browser-executable code and amend the 

specification paragraphs [0241] and [0716] as follows: 

[0241]A "comparison window", as used herein, includes reference 
to a segment of any one of the number of contiguous positions 
selected from the group consisting of from 20 to 600, usually about 
50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same 
number of contiguous positions after the two sequences are 
optimally aligned. Methods of alignment of sequence for 
comparison are well-known in the art. Optimal alignment of 
sequences for comparison can be conducted, e.g. by the local 
homology algorithm of Smith & Waterman, Adv. Appl. Math. 
2:48. sup. 2, 1981, by the homology alignment algorithm of 
Needleman & Wunsch, J. Mol. Biol. 48:443, 1970, by the search 
for similarity method of person & Lipman, Proc. Nat'l. Acad. Sci. 
USA 85:2444, 1988, by computerized implementations of these 
algorithms (GAP, BESTFIT, FASTA and TFASTA in the 
Wisconsin Genetics Software Package, Genetics Computer Group, 
575 Science Dr., Madison, Wis.), or by manual alignment and 
visual inspection. Other algorithms for determining homology or 
identity include, for example, in addition to a BLAST program 
(Basic Local Alignment Search Tool at the National Center for 
Biological Information), ALIGN, AMAS (Analysis of Multiply 
Aligned Sequences), AMPS (Protein Multiple Sequence 
Alignment), ASSET (Aligned Segment Statistical Evaluation 
Tool), BANDS, BESTSCOR, BIOSCAN (Biological Sequence 
Comparative Analysis Node), BLIMPS (BLocks IMProved 
Searcher), FASTA, Intervals & Points, BMB, CLUSTAL V, 
CLUSTAL W, CONSENSUS, LCONSENSUS, WCONSENSUS, 
Smith- Waterman algorithm, DARWIN, Las Vegas algorithm, 
FNAT (forced Nucleotide Alignment Tool), Framealign, 
Framesearch, DYNAMIC, FILTER, FSAP (Fristensky Sequence 
Analysis Package), GAP (Global Alignment Program), GENAL, 
GIBBS, GenQuest, ISSC (Sensitive Sequence Comparison), 
LALIGN (Local Sequence Alignment), LCP (Local Content 
Program), MACAW (Multiple Alignment Construction & Analysis 
Workbench), MAP (Multiple Alignment Program), MBLKP, 
MBLKN, PIMA (Pattern-Induced Multi-sequence Alignment), 
SAGA (Sequence Alignment by Genetic Algorithm) and WHAT- 
IF. Such alignment programs can also be used to screen genome 
databases to identify polynucleotide sequences having substantially 
identical sequences. A number of genome databases are available, 
for example, a substantial portion of the human genome is 
available as part of the Human Genome Sequencing Project (J. 
Roach, 
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http://weber.u.Washington.edu/.about.roach/human_genome_progr 
oss 2.html) (Gibbs, 1995). At least twenty-one other genomes have 
already been sequenced, including, for example, M. genitalium 
(Fraser et al., 1995), M. jannaschii (Bult et al, 1996), H. 
influenzae (Fleiscmann et al., 1995), E. coli (Blattner et al., 1997) 
and yeast (S. cerevisiae) (Mewes et al., 1997) and D. melanogaster 
(Adams et al., 2000). Significant progress has also been made in 
sequencing the genomes of model organism, such as mouse, C. 
elegans and Arabadopsis sp. Several databases containing genomic 
information annotated with some functional information are 
maintained by different organization and are accessible via the 
internet One example of a useful algorithm is BLAST and BLAST 
2.0 algorithms, which are described in Altschul et al., Nuc. Acids 
Res. 25:3389-3402, 1977 and Altschul et al., J. Mol. Biol. 
215:403-410, 1990, respectively. Software for performing BLAST 
analyses is publicly available through the National Center for 
Biotechnology Information. This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or 
satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as 
the neighborhood word score threshold (Altschul et al., supra). 
These initial neighborhood word hits act as seeds for initiating 
searches to find longer HSPs containing them. The word hits are 
extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores 
are calculated using, for nucleotide sequences, the parameters M 
(reward score for a pair of matching residues; always >0). For 
amino acid sequences, a scoring matrix is used to calculate the 
cumulative score. Extension of the word hits in each direction are 
halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score 
goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or the end of either sequence 
is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The 
BLASTN program (for nucleotide sequences) uses as defaults a 
wordlength (W of 11, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the 
BLASTP program uses as defaults a wordlength of 3 and 
expectations (E) of 10 and the BLOSUM62 scoring matrix (see 
Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915, 1989) 
alignments (B) of 50, expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. 
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[07 16] While the invention has been described in detail with 
reference to certain Exemplary aspects thereof, it will be 
understood that modifications and variations are within the spirit 
and scope of that which is described and claimed. 

Sequence CWU 0 SQTB SEQUENCE LISTING The patent 
application contains a lengthy "Sequence Listing" section. A copy 
of the "Sequence Listing" is available in electronic form from the 
USPTO ~ web site 

(http://seqdata.uspto.gov/7pageRequest~ docDetail&DocID~US20 
080233 175A1) . An electronic copy of the "Sequence Listing" will 
also be available from the USPTO upon request and payment of 
the fee set forth in 37 CFR 1 . 1 9(b)(3). 
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